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Abstract —We consider a single-cell downlink massive MIMO 
communication in the presence of an adversary capable of jam¬ 
ming and eavesdropping simultaneously. We show that massive 
MIMO communication is naturally resilient to no training-phase 
jamming attack in which the adversary jams only the data com¬ 
munication and eavesdrops both the data communication and the 
training. Specifically, we show that the secure degrees of freedom 
(DoF) attained in the presence of such an attack is identical to the 
maximum DoF attained under no attack. Further, we evaluate 
the number of antennas that base station (BS) requires in order 
to establish information theoretic security without even a need 
for Wyner encoding. Next, we show that things are completely 
different once the adversary starts jamming the training phase. 
Specifically, we consider an attack, called training-phase jamming 
in which the adversary jams and eavesdrops both the training 
and the data communication. We show that under such an attack, 
the maximum secure DoF is equal to zero. Furthermore, the 
maximum achievable rates of users vanish even in the asymptotic 
regime in the number of BS antennas. To counter this attack, we 
develop a defense strategy in which we use a secret key to encrypt 
the pilot sequence assignments to hide them from the adversary, 
rather than encrypt the data. We show that, if the cardinality 
of the set of pilot signals are scaled appropriately, hiding the 
pilot signal assignments from the adversary enables the users 
to achieve secure DoF, identical to the maximum achievable 
DoF under no attack. Finally, we discuss how computational 
cryptography is a legitimate candidate to hide the pilot signal 
assignments. Indeed, while information theoretic security is not 
achieved with cryptography, the computational power necessary 
for the adversary to achieve a non-zero mutual information 
leakage rate goes to infinity. 

I. Introduction 

Massive MIMO is one of the highlights of the envisioned 
5G communication systems. In massive MIMO paradigm, the 
base station is equipped with a number of antennas, typically 
much larger than the number of users served. Combined 
with a TDD-based transmission, this solves many of the 
issues pertaining channel state information. In particular, the 
base station exploits law-of-large-numbers-like certainties as 
it serves each user over a combination of a large number of 
channels. 

While many issues behind the design of multicellular mas¬ 
sive MIMO systems have been studied thoroughly, security 
of massive MIMO has not been actively addressed. Part of 
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the reason for this may be the fact that, there is a vast 
literature on the security of MIMO systems in general, and 
a common perspective is that massive MIMO is merely an 
extension of MIMO as it pertains to security. However, we 
demonstrate that massive MIMO has unique vulnerabilities, 
and standard approaches to MIMO security do not address 
them directly. Instead, these approaches focus on issues that 
massive MIMO is naturally immune to. Furthermore, we argue 
that, common models used in MIMO security eliminate the 
need to think on various components of the system that are 
critical to understanding the vulnerabilities in security. In 
particular, in massive MIMO, merely making assumptions on 
available channel state information (CSI) is not sufficient, 
since the actual technique the system uses to obtain CSI may 
be the lead cause for some major security issues. For all 
these reasons, security of massive MIMO calls for a separate 
treatment of its own. 

To that end, we consider the TDD-based single cell down¬ 
link massive MIMO system developed in fS) and later read¬ 
dressed in IT]. The adversary is hybrid, capable of jamming 
and eavesdropping at the same time with its multiple antennas 
and we call our system secure if secrecy, measured in full 
equivocation is achieved at the adversary and arbitrarily low 
probability of decoding error is achieved at the legitimate re¬ 
ceiver. We refer to these requirements as security constraints. 
We first show how massive MIMO is naturally resilient to 
standard jamming and eavesdropping attacks, unless jamming 
is performed during the training phase when pilot signals are 
transmitted by the mobile users. We prove that, without pilot 
jamming, the achievable secure degrees of freedorr{jDoF) is 
identical to the maximum DoF attained under no attack, even 
without the need to use a stochastic (e.g., Wyner) secrecy 
encoder in the massive MIMO limit. On the other hand, as 
we will show, the adversary can reduce the maximum secure 
DoF and rate to zero by contaminating the pilot signal of the 
targeted user via another correlated pilot signal. To address 
this attack, we develop a defense strategy in which the base 
station (BS) keeps the assignment of pilot signals to the users 
hidden from the adversary and informs the assignments to 
the users reliably. Thus, in our approach, we use computa¬ 
tional cryptography for encrypting the pilot assignments in 
the training phase. We also discuss how the consequences of 
encryption of pilot assignment is fundamentally different from 
the consequences of data encryption. In particular, we argue 

*Our definition of degrees of freedom is different from the standard 
definition. Our definition specifies how the achievable rate scales with the log 
of the number of base station antennas, rather than the log of the transmission 
power as in the standard definition. 


that, even if we use non-information theoretic methods (e.g., 
Diffie-Hellman) to encrypt the pilot assignments, the level of 
security we achieve can be as strong as information theoretic 
secrecy for all practical purposes. Note that, most of our results 
are not asymptotic in the number of antennas and we specify 
the number of antennas necessary to achieve certain level of 
security. 

The major ideas developed and demonstrated in this paper 
include: 

• In information-theoretic secrecy literature, it is often the 
case that assumptions are made on the CSI available at 
the adversary. Typically, it is assumed that the adversary 
has access to the CSI for all channels in the system, with 
the motivation of making the achievable security robust 
with respect to the availability of CSI at the adversary. 
However, we show that, with massive MIMO, it is not 
important if the adversary has full CSI or not. Indeed, 
we show that massive MIMO is naturally immune to 
attacks during data communication phase. Instead, we 
demonstrate that the major question is how the adversary 
obtains CSI. In particular, we show that if the adversary is 
active during the training phase, it substantially degrades 
the security of data communication. 

• Security in computational cryptography is based on the 
assumptions on the computational power of the attackers. 
Once data is encrypted, it takes an unreasonable amount 
of time for a typical adversary to decrypt it without the 
key. Making such an assumption on the adversary poses 
a problem for security, since a sophisticated adversary 
can use various tools and techniques to cut down the 
time for cryptanalysis applied to recorded encrypted data. 
We eliminate this shortcoming by encrypting the pilot 
assignments -not the transmitted data,- using keys that 
are shared via standard Diffie-Hellman. In our scheme, to 
make an impact, the adversary needs to decrypt the pilot 
assignment before the training phase starts. Note that, the 
training phase can start immediately after the assignments 
are made, leaving an arbitrarily low amount of time for 
the adversary to crack the assignment (i.e., pushing the 
computational power necessary to infinity). Without the 
knowledge of the pilot assignment, our scheme achieves 
perfect secrecy of information transmitted in the data 
communication phase, even without the use of a secrecy 
encoder. Thus, it is useless for the adversary to record 
the received signal for future cryptanalysis, since it is 
indifferent from noise. 

Next, we summarize the technical contributions of our paper. 
Throughout the paper, we assume that the adversary is full- 
duplex, i.e., it is capable of eavesdropping and jamming the 
BS-to-user communication simultaneously. In the first part of 
the paper, we study an attack model in which the adversary 
eavesdrops the entire communication between the BS and 
users and jams only the downlink data communication (the 
adversary keeps silent during the training.). Under this attack: 

• We show that the maximum secure DoF is identical 


to the maximum DoF achieved in the presence of no 
adversary. 

• We provide a novel encoding strategy, 5-conjugate beam¬ 
forming, that provides full security, without the need for 
Wyner encoding Ho). 

• We evaluate the number of antennas that the BS requires 
in order to satisfy the security constraints. 

The proposed encoding, 5-conjugate beamforming, utilizes the 
fact that the correlation between the estimated BS-to-user 
channel gains and the BS-to-adversary channel gains becomes 
zero when the adversary does not jam during the training 
phase. We observe that in order to cause a non-zero correlation 
between the estimated BS-to-user channel gains and the BS- 
to-adversary channel gains, the adversary has to jam the pilots 
of users. 

In the second part of the paper, we consider an attack 
model in which the adversary eavesdrops and jams the entire 
communication (including the training) between the BS and 
the users. Under this attack: 

• We show that, if the adversary jams the training such 
that there exists a non-zero correlation between the BS- 
to-adversary channel gain and the estimated gain of the 
channel from the BS to a user, the adversary reduces the 
maximum secure DoF to zero. Further, we show that, 
if the amount of the correlation is sufficiently large, the 
maximum achievable rate of the user also vanishes as the 
number of antennas at the BS grows. 

• We propose a counter strategy against the adversary. We 
show that, if the cardinality of the set of pilot signals 
scales with the number of antennas at the BS and the BS 
is able to keep the pilot signal assignments hidden from 
the adversary, attained secure DoF is arbitrarily close to 
the maximum DoF attained under no attack. 

Related Work: Massive MIMO concept was first proposed 
in a, 11. Since then, there has been a flurry of studies 
focusing on different aspects of massive MIMO (see sur¬ 
vey 0) such as channel estimation, energy efficiency, and 
pilot contamination. However, while MIMO security has been 
an active area of research 0, Q, 0, issues specific to 
massive MIMO have not been considered. Among the very 
few, in a, the authors consider downlink multi cell massive 
MIMO system in the presence of an adversary that only 
eavesdrops. In order to confuse the adversary, the BS transmits 
artificial noise from a set of its antennas. The authors conclude 
that, if the adversary has sufficiently large number of antennas, 
it is impossible to operate at a positive rate with artificial noise 
generation at the BS. In our earlier work a, which sets up 
the main results in this paper, we have focused on a fairly 
different model and addressed other questions. For instance, 
our attack model considers both jamming and eavesdropping, 
possibly simultaneously by the adversary. 

II. System Model and Problem Statement 

We consider a multi user MIMO downlink communication 
system, depicted in Figure including a base station (BS), 
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Fig. 1. System Model 


K single-antenna users, and an adversary. The BS equipped 
with M antennas wishes to broadcast K distinct messages 
[Wi ,..., Wk] each of which is intended for a different user. 
The adversary is equipped with Mf. antennas. 

A. Channel Model 

We assume all the channels in our system, illustrated 
in Figure [T] are block fading. In the block fading channel 
model, time is divided into discrete blocks each of which 
contains T channel uses. The channel gains remain constant 
within a block and the channel gains on different blocks 
are independent and identically distributed. Furthermore, we 
assume the channels are reciprocal; the instantaneous gain of 
the channel connecting the BS to a user is as same as the gain 
of the channel connecting to the same user to the BS. 

We follow a TDD-based two-phase transmission scheme 
introduced in im and is re-adressed in IT]. The signal 
transmission in a block is separated into two phases; training 
phase and data communication phase. On the first Tr channel 
uses of every block, each user sends a pilot signal to the BS. 
The BS estimates each BS-to-user channel from the observed 
pilot signals. On the last Td channel uses of each block 
(Td = T — Tr), the BS transmits data to the users. 

The observed signals during a data communication phase at 
fc-th user and at the adversary at a particular channel use of 
i-th block are as follow^ 

Yk = Hk(l)X + Hjam,k{'i)Vjam + Vk (1) 

Z = H,ii)X + K, (2) 

where is a received complex signal at fc-th user, Z is 
a received x 1 complex vector at the adversary, and X 
denotes M x 1 complex vector of transmitted data symbols. 
Signals 14 and 14 are additive Gaussian noise components, 
distributed as CiM(0,1) and CX(0,1^^), respectively. Signal 
Vjam denotes Me x 1 complex vector of jamming signal. 
Further, Hk{i) and Hjam,k(i) denote a 1 x M complex 
gain vector of the channel connecting the base station to 
fc-th user, a 1 x Mg complex gain vector of the channel 

^Except for the channel gains, we avoid the block and channel use indices 
in |T} and and the block indicies in and for the sake of simplicity. 


connecting the adversary to A:-th user, respectively, at i- 
th block. Similarly, Hg{i) is the Me x M complex gain 
matrix of the MIMO channel connecting the base station 
to the adversary at i-th block. We assume that all channel 
gains He{i), ... HK{i), ■ ■ ■, Hjam,K{i) are 

mutually independent for any i > 1. 

The users send pilots in the first Tr channel uses of each 
block. The received signals at the BS and at the adversary in 
the training phase of i-th block are as follows: 

K 

yT =Y^HI (i)4fc + Hj + W, (3) 

k^l 

K 

=T.^lam,k{i)^k + W,, (4) 

fe=l 


where and denote M x Tr and Me x Tr complex 
matrices of the received signals over Tr channel uses at the 
BS and at the adversary, respectively. Signals W and 144 
are M x Tr and M x Tr complex matrices denoting the 
additive Gaussian noise. Each element of W and 144 are i.i.d 
634(0,1). Signal Vjam denotes Me x Tr complex matrix of 
jamming signal. Signal 4^ is 1 x Tr complex vector denoting 
the pilot signal associated with A:-th user. The power of pilot 
signals pr, i.e., 7jr\i {4i^(j)k) = Pr is identical for all users 
fee {!,...,iT}. 

We assume that the users do not have the knowledge of the 
BS-to-user channel gains. Note that the BS, the users, and the 
adversary know pilot signal set ..., 4>k\- The adversary is 
assumed to be aware of which pilot signal is assigned to which 
user. Utilizing the pilot signals, the BS estimates the BS-to- 
user channel gains. Define Hk{i) as 1 x M complex vector of 
estimated BS-to-fc-th user channel gain. Further, for any B > 
1, define , H^, , and as the gains of the BS-to- 

user channels, the estimated gains of the BS-to-user channels, 
the gains of the BS-to-adversary channel, and the gains of the 
adversary-to-user channels over B blocks, respectively, i.e.. 


^ [Hi 

tB 


B 




RB a 


ttB ttB 

Hi ,..., Hj^ 


and 


For any B > 1, the joint probability density function of 

f ttB ttB ttB ttB 

j rig ,Bjam 


n ~ ( 


Uph, 

2=1 




z),/i('i),/ig(i),/ijfjm (^)) (5) 


where H = [Hi, ..., Hk], H = Hi,..., Hk , and Hjam = 
Hjam,K]- For any k S Ff}, Hk and 

Hjam,k are distributed as 634(0, Jm), 634(0,/Me), respec¬ 
tively, and each element of matrix i/g is i.i.d 6A^(0,1). 

The adversary has the perfect knowledge of the BS-to-user 
channel gains H and the estimated BS-to-user channel gains 
H. Define Hk^ and Hk^ as the gain and the estimated gain 
of the channel connecting m-th BS antenna to fc-th user. We 
assume that for any k G {1,... ,H}, {//fc„FTfe„}m>i forms 










an i.i.d process. We also assume that are independent with 
Hi and E 


X • ■ 


HkH 


= 0 for k ^ I and k,l G {1,..., K}. 
Note that we do not impose these assumptions for the BS-to- 
adversary channels. 

Remark 1. When MMSE estimator and mutually orthogonal 
pilot signals are employed at the BS for channel estimation, 
these assumptions are satisfied. 

B. Attack Model 

We consider a full duplex adversary that is capable of 
eavesdropping and jamming simultaneously. In the sequel, we 
consider two attack models that differ only in the adversary’s 
jamming activity in the training phase. 

In Section ||n] we consider an attack model in which the 
adversary jams only during the data communication phase 
and eavesdrops both the training and the data communication 
phases. We call this attack model as no training-phase jam¬ 
ming. In the no training-phase jamming, the adversary jams 
during the communication phase using a Gaussian jamming 
signal and keeps silent during the training phase. Specifically, 
signal Wjam in 0 is identical to zero and jamming signal 
Vjam in 0 is distributed as e'N{0, pjamlMj, where pjam is 
the jamming power. 

In Sections |lV] and |V) we consider an attack model in 
which the adversary jams and eavesdrops both the training 
and the data communication phases. We call this attack model 
as training-phase jamming. The adversary strategy during the 
data communication phase in this attack model is the same as 
that described in the previous attack model (i.e., no training- 
phase jamming). Instead of jamming with random signals, 
the adversary jams during the training phase with structured 
signals. We provide a detailed description of the signals used 
for jamming the training phase in Section IV and|V] 


C. Code Definition 

The BS aims to send message Wk G W^., k = 1,...,K, 
to fc-th user over B blocks with rate Rk, while keeping Wk 
secret from the adversary. The BS and the users employ code 
^ 2 BTRi ^ ^ ^ BTf) of length BTd, that contains: 

1) K message sets, W^. = {!,..., 2®^'^'=}, k = \,... ,K. 

2) K injective encoding functions, fk, k = 1,... ,K, where 
fk maps Wk G Wk to data signal sequence 

satisfying an average power constraint such that 


3) Linear beamforming that maps data signal^ sf 

to channel inpuj^ . Two beamforming strategies 
are used throughout the paper: 

• Conjugate beamforming: When the BS employs conju¬ 
gate beamforming, channel input at j-th channel use of 
*-th block can be written as 




fc=l 


( 8 ) 


B} and j G {Tt- -|- 1,..., T}, where 


for any i G {1,. 

ak = E[\HkJ^ 

^-conjugate beamforming: We introduce a new beam¬ 
forming strategy, called J-conjugate beamforming that is 
slightly modified version of conjugate beamforming. Let 
^ be a positive real number. When the BS employs S- 
conjugate beamforming, the channel input at j-th channel 
use of i-th block can be written as 




(9) 




Note that, when <5 = 0, (^-conjugate beamforming be¬ 
comes identical with conjugate beamforming in 0. 

4) Decoding functions, gk, k = 1,...,K, where gk maps 
to Wk G Wk. 


D. Figures of Merit 

We define the average 
,..., 2®™^, BTd) as 


error probability of code 


K 


n = P U OkiY^^'^) ^ Wk 


\k=l / 

where Wk is uniformly distributed on Wk. We assume that 
the adversary targets a single user during communication. The 
secrecy of the transmitted message for fc-th user is measured 
by the equivocation rate at the adversary, which is equal to 
the entropy rate of transmitted message Wk conditioned on 
the adversary’s observations. 

Definition 1. A secure rate tuple Ri,...Rk is said to be 
achievable if for any e > 0, there exists B{e) > 0 and a 
sequence of codes , ■ ■ ■, 2^^^^, BTd) that satisfy the 

following: 


^ \sk{i,j)\^ < Pk, k = l,...,K (6) 

for all Wk G Wk, where notation (i,j) indicates the j-th 
channel use of i-th block, pk denotes the power constraint 
for fc-th user, and Sk{i,j) is the complex data signal of fc- 
th user. Note that Pf = ^k=i Pk the cumulative average 
transmission power. Further, note that encoding functions, 
fk, k = 1,...,K can be deterministic or stochastic. Codes 
using stochastic encoding functions referred to as stochastic 
codes and the ones using deterministic encoding functions are 
referred to as deterministic codes. 


Pe < e, ( 10 ) 

^H(Wk\Z^^,H^,H^,H^)>Rk-e ( 11 ) 

^Note that = {skii,j)}i=i:B,j=T,.+i:T and notation (j^^a 

applied to any variable has the same meaning. 

^Note that the channel input sequence satisfies the following average power 
constraint ^ ^ 

4x12 tl ^[\\X{i,j)\\^]<Pf (7) 

d- i=l j=Tr + l 

for all X • • • X ^ x • • • x Wpf, where the expectation is over 
estimated channel gains H. The inequality jTJ follows from the individual 
power constraint ID and from the fact that = 0 for k ^ I 












for all B > B{e) and k G {1,..., K}, where is the 

received signal sequence at the adversary over BT channel 
uses. 


We refer to the constraints in ( [TOl i and ( [TT] i as decodability 
and secrecy constraints, respectively. We also refer to both 
constraints as security constraints. We call the communication 
system information theoretically secure if both constraints are 
satisfied. Notice that the achievable rate tuple definition above 
is presented for a given M, i.e., M remains constant for a 
sequence of codes , BTd), B > B{e). 

In this paper, we mainly focus on the massive MIMO 
limit. Specifically, we study on how achievable rate tuple 
Ri,..., Rk behaves as M goes to infinity. To that end, we 
use the following notion of degrees of freedom for each user. 


Definition 2. A secure degrees of freedom tuple di,... jdx 
is said to be achievable, if there exists achievable rate tuple 
i?i,... Rk such that 

Rk 


dk = lim 


M->oo log M ’ 


k = 1, 


,K. 


( 12 ) 


In the literature, degrees of freedom is typically defined as 


the limit lim„ 


Since we aim to understand how 


Pfc—foo iQg ■ 

changes with M under constant p^, the degree of freedom 
definition in ( fTSl i is more relevant for our interest. 

For a given achievable secure degrees of freedom tuple 
di,... ,dK, we define the secure degrees of freedom of the 
downlink communication as the minimum value in the tuple, 
i.e., secure DoF = min^g^i dk- In the rest of the 
paper, when we use secure DoF, we mean secure degrees 
of freedom attained in the presence of an adversary, and when 
we use DoF, we mean degrees of freedom attained under no 
adversary. 

In this paper, we characterize the maximum secure DoF 
in the presence of various security attacks described in Sec¬ 
tion II-B Furthermore, we aim to develop defense strategies 


that achieve the maximum secure DoF against the security 
attacks that would limit the maximum secure DoF to zero, 
otherwise. 


III. Adversary not jamming The Training Phase 
In this section, we show that downlink communication in a 
single-cell massive MIMO system is resilient to the adversary 
that jams only the data communication phase and eavesdrops 
both the communication and training phases. We show that 
the maximum secure DoF attained under no training-phase 
jamming is identical to maximum DoF attained under no 
adversary. Then, we show that we can establish information 
theoretic security without using stochastic encoding, e.g., 
Wyner encoding. Finally, we evaluate the number of antennas 
that BS needs to satisfy the security constraints without a need 
for Wyner encoding. 

A. Resilience of massive MIMO 

In this subsection, we evaluate the maximum secure DoF 
of the downlink communication in the presence of no training- 
phase jamming. Then, we show that the maximum secure DoF 


attained in the presence of no training-phase jamming is as 
same as the maximum DoF attained without an adversary. 
This result demonstrates the weakness of the no training-phase 
jamming in the massive MIMO limit. 

Theorem 1. (Maximum secure DoF) For given block length 
T and data transmission phase length T^, the maximum 
secure DoF under no training-phase jamming is given by 


The complete proof is available in Appendix where we 
first provide an upper bound on secure DoF and then present 
a strategy to achieve the upper bound. Here, we provide a 
proof sketch. In order to find an upper bound on secure 
DoF, we consider a multiple output single output (MISO) 
communication system without an adversary, in which the 
BS communicates to a single user under power constraint pf. 
Further, we assume that the BS and the user have a perfect 
information of the channel gains. We show that the supremum 
of achievable rates leads to a secure DoF of Hence, we 
conclude that ^ is an upper bound on secure DoF attained in 
the multi user downlink communication model in Section [II] 

We now describe a strategy to attain the maximum secure 
DoF in Theorem[2 On the first Tr channel uses of each block, 
the users send pilot signals that are mutually orthogonal. 
The BS uses minimum mean square estimator (MMSE) to 
estimate the BS-to-user channel gains. The BS constructs K 
codebooks, Ck, k = 1,...,K, where codebook Ck contains 
2 ^TRk independently and identically generated codewords, 
of length BT^ and Rf^ > R^. The BS maps fc-th user’s 
message to a codeword with a stochastic mapping function 
fk. Specifically, the BS maps message Wk G {l,. ■., 2^^^'=} 
to randomized message rrik G 2^^^*=} as in Qo) and 

then maps randomized message rrik to one of the codewords 
in Ck, k = 1,... ,K. Utilizing the conjugate beamforming 
in ([^, the BS maps K codewords, k — 1,...,K to 

channel input sequence Each user employs typical set 

decoding na. In order to show that secrecy constraint 
for a particular user is satisfied, we give the adversary the 
other users’ transmitted codewords. □ 

In the next couple of remarks, we emphasize the robustness 
of the downlink communication system against no training- 
phase jamming. 

Remark 2. (The weakness of the adversary not jamming 
the training phase) In the proof of Theorem IT] we show that 

rp l-J 

^ is indeed an upper bound on the DoF of a downlink 
communication without the presence of an adversary. Hence, 
with also showing that the secure DoF of ^ is attained in 
the presence of the adversary, we conclude that no training- 
phase jamming attack does not degrade the performance of the 
communication in terms of DoF. The reason that secure DoF 
of ^ is achieved is that the adversary keeps silent during the 
training phase; hence the estimated BS-to-user channel gains 
are independent with Hg. 






Fig. 2. The variation of R^. with M and Me 


In the next section, we consider an adversary jamming the 
training phase. In the presence of such an adversary, the BS- 
to-user channel gains become correlated with and the 
maximum secure DoF is reduced to zero. 

Remark 3. (Resource race between the BS and the 
adversary) In Appendix ^ we show that the achievable 
rate tuple that leads to a secure DoF of ^ is = 

^ log (i+^ log (1+MePfc), k = i,...,K, 
where a = . 

prTr~\~A 

We next investigate how Rk varies in Me and M. Figure 
illustrates this variation when = 1, pf = 10, ^ = 0.99, 
Pjam = titid a = 0.9. As seen in Figure^ in the presence of 
the adversary not jamming the training phase, the achievable 
secure rates are determined as a result of the arms race 
between the adversary and the BS. Specifically, we can 
observe that if Mg remains constant, achievable rate Rk grows 
unboundedly as M is increasing. Moreover, for a fixed value 
of M, the achievable rates decrease as a function of Mg. 
In the next section, we consider an adversary jamming the 
training phase instead of keeping silent during the training 
phase. We will show that, armed with only a single antenna, 
the adversary is capable of limiting the maximum achievable 
rate for any user to zero as M ^ oo. Hence, by jamming the 
training phase, the adversary converts the arms race between 
the BS and itself to the one between an user and itself. 

B. Establishing security without Wyner encoding 

In the achievability strategy given in the proof sketch of 
Theorem [T] we use a stochastic encoding, a randomized 
mapping of each message to a codeword with stochastic 
functions, at the BS. In fact, stochastic encoding, e.g., Wyner 
encoding m, is a standard technique in the literature for 
establishing information theoretic security against the eaves¬ 
dropping attacks. 


In this section, we show that the BS utilizing determin¬ 
istic encoding, a nonrandom mapping of each message to a 
codeword with deterministic functions, instead of stochastic 
encoding is capable of satisfying the security constraints if it 
is equipped with sufficiently large number antennas. In order to 
satisfy the security constraints without using stochastic encod¬ 
ing, the BS employs novel beamforming strategy introduced 
in (|^. 

The following theorem shows that, when code 

of length BTd utilizes S- 
conjugate beamforming instead of conjugate beamforming 
in (|8]l, the code satisfies the secrecy constraint in ( [TT] ) for 
any k G {1,.. . and for any e > 0 without a need for 
stochastic encoding. 

Theorem 2. (Establishing secrecy with no stochastic en¬ 
coding) Let 5 > 0. Under no training-phase jamming, 
for any e > 0, if M > S'(e), then any determinis¬ 
tic code ,..., 2^^^^, i?Td) employing 6-conjugate 

beamforming satisfies 

^H(Wk\Z^^fH^,H^,H^)>Rk-e (13) 

for all B > 1 and for all k G {1, ..., iT}, where 



and Pmax — pk. 


We can consider S{e) in Theorem]^ as the number of the 
antennas the BS needs in order to make the conditional entropy 
e-close to Rk for a\\ k G {1,... ,K). Hence the BS equipped 
with at least ^(e) antennas can satisfy ( [T3] l by harnessing any 
code (2®"^^!,..., 2^^^^, BTd) that employs deterministic 
encoding functions and (^-conjugate beamforming. 

The proof is available in Appendix B-A| The BS constructs 
K codebooks, Ck, k = 1,... ,K, where codebook Ck contains 
2 ^TRk codewords, of length BTd. The BS maps message 
Wk to codeword with a deterministic function, fk, k = 
1,...,K. Utilizing the (5-conjugate beamforming in ([^, the 
BS maps K codewords, fc = 1,..., AT to channel input 

sequence 

In Figure]^ we illustrate the variation of S{e) with e when 
Pk = 1, 6 — 0.7, T/Td = 5/4, and Mg = 1. As seen in 
Figure 100 antennas at the BS are sufficient to make the 
equivocation rate above Rk — 0.05 for any choice of Rk, k = 


Theorem evaluates the number of antennas needed in 
order to satisfy only the secrecy constraint. The following 
corollary takes both the secrecy and decodability constraints 
into account. 


Corollary 1. (Any rate tuple is achievable with no need 
to stochastic encoding) Let Q < 5 < 1. In the presence of no 
training-phase jamming, for any e > 0 and any rate tuple R = 
[i?i,... ,Rk], if M > max (U(i?), S'(e)), there exists B{e) > 












Fig. 3. The variation of S{e) with e when pf. = 1, S = 0.7, T/Td = 5/4, 
and Me = 1. As long as M > 5(e), 
remains e-neighborhood of for any k S 


0 and sequence of codes ,... , BTfj , B > 

B{e) that satisfy the constraints in and ( [TT] i without the 
use of stochastic encoding, where 

V{R)^ max (-l) x Pf + Pi---+ _ 

ke{i,...,K} \\ ) apk J 
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(5 

Fig. 4. The variation of max (y(i?), 5'(e)) with (5 when e = 0.05, TjTd = 
5/4, Me = 1, pk = 1, and Ry. = 0.2 for any k G {1,..., K}. As long 
as M > max(5(€), V^(i?)), constraints in (Tg and (m are satisfied for a 
given e and R without a need for stochastic encoding. 

= lim - -r— —J -^ 

Rk^OO Jgg _ ij 

= ( 16 ) 

for all k € {1,..., K}, for any e > 0 and for any {Ri}i^k- 
The equalities in Cl and d in the above derivation follow 
from the fact that max(V(i?), S'(e)) and V{R) are increasing 
functions of Rk- Wfe observe from d that by choosing 6 
close to 0, we can make the difference between d and 
the maximum secure DoF provided in Theorem arbitrarily 
small. 


The proof of Corollary [T] can be found in Ap¬ 
pendix B-B The sequence of codes in Corollary utilizes 
(5-conjugate beamforming. Figure illustrates the variation 
of max(V(i?),S'(e)) with d when e = 0.05, T/Td = 5/4, 
Me = 1, Pfc = 1, and Rk = 0.2 for any /c G K}. Note 

that, for these parameters, max (V(i?), S'(e)) is minimized 
and identical to 60 when 5 — 0.78. When the BS utilizes 
(5-conjugate beamforming with 5 = 0.78, the BS requires at 
least 60 antennas in order to satisfy the constraints in ( [T0| l and 
d without the need for a stochastic encoding (e.g., Wyner 
encoding). 


Remark 4. (Achieving secure DoF arbitrarily close the 
maximum DoF with no Wyner encoding) Theorem and 
Corollary show that it is possible to establish information 
theoretic security without using stochastic encoding. We next 
measure the amount of DoF sacrificed as a result of not 
utilizing stochastic encoding. To that end, we evaluate how 
number of antennas at the BS max(V(i?), S'(e)) scales with 
Rk for given e > 0 and {Ri}i^k- Specifically, we calculate 

log max(rifl).5(e)) 


lim -—= lini - - —- (14) 

Rk^oa logmax(V(i?), 5(e)) Rk^^ logV(i?) 


IV. ADVERSARY JAMMING THE TRAINING PHASE 
In the previous section, we show that the adversary not 
jamming during the training phase does not degrade the 
performance of the multi user communication when the BS 
has sufficiently large number of antennas. In this section, we 
aim to find attack model that do degrade the performance. 
Specifically, we focus on finding an attack strategy capable 
of limiting secure DoF to an arbitrarily small value. Next 
theorem sheds light on finding such an attack strategy. 


Theorem 3. (A non-zero correlation between the estimated 
user channel and the adversary channel gains limits the 
maximum secure DoF to zero) Assume that there exists user 
k such that 

S an i.i.d random process. 

• For any B > 1, there exists a random vector Hjf that 
satisfies the following: 1) the joint probability distribution 
of H^,H^ is identical with that of F[^,H^, where 
= Hi ,..., ,..., H// and 2) the joint probability 

distribution ofH{i),H{i) is identical for any i G [1 : B]. 


Then, the maximum secure DoF is zero if E 

□ 


h: Hk 




Note that random vector is created Iw replacing Hjf 
in with . The proof of Theorem ^ can be found 

the gain of the connecting m-th antenna at the BS to the adversary. 














in Appendix C-A In the example given at the end of this 
section, we show that the assumptions listed in Theorem 
that are related to the random variables hold when MMSE 
and mutually orthogonal pilot signals are used as a channel 
estimation strategy. Note that such an estimation strategy is 
quite popular in the multi-user communication m- 

We next give a proof sketch. Assume that conjugate beam¬ 
forming is used at the BS and Me = 1. Note that Theorem 
1^ is also valid when the BS uses (5-conjugate beamforming 
and Me > 1. We can convert the communication set-up 
explained in Section to an identical set-up containing a BS 
equipped with K antennas, where the channel input signal at 
l-th antenna in the new set-up represents the data signal for Tth 
user Si, I = 1,..., K. Since conjuagte beamforming is used, 

the gain of the channel connecting /-th antenna to i-th user in 

Hi Hr 


the new set-up is 


and the gain of the channel connecting 


l-th antenna to the adversary in the new set-up is 
i,l = 1,..., K. Following the assumptions in Theorem]^ we 
show that the gain of the channel connecting the BS to the ad- 
versary can b^placed with ..., ..., 

In Appendix C-A we bound Ri 


Mai ’ ■ ■ ■ ’ V 

-fc as follows 


iife < E 


where Ai. = 


max (log (1 -I- Ak^Al) 
SG s 




-log(l + AeEA:))] 


(17) 


HkHT 


HkH* 


m —, • ■ • 1 11 ^ is 1 X AT com- 

^Mak ’ ’ Mok 

plex gain vector of channels connecting the BS to fc-th user. 


and Ae = 


HkHi 


HkH* 


is 1 X AT complex 


(fai ’ * ' ' ’ yjMak ’ * ■ ■ ’ y/MoK 

gain vector of channels connecting the BS to the adversary. Let 
E be the covariance matrix of input signal S' = ,..., Sk] 

and § be the feasible set for the maximization problem in ([T7|l. 
Every matrix E in set § is diagonal due to fact that Si,..., Sk 
are independent, and satisfy E ^ diag{pi,..., pk) due to the 
power constraint in (|^. 

We show that, if E [H^ iTe„] 7 ^ 0, then the right hand 
side (RHS) of ([^ over logM goes to zero as M —>■ oo. 
Hence, the maximum secure DoF becomes zero. □ 


Remark 5. (Adversary has to jam the training 
phase) When the adversary does not jam the training 
phase, Tffc and Fie independent and consequently 

E = E \HkJ\ E [HlJ\ = 0 for all 

k S {1,...,A'}. In order to have a non-zero correlation 
between the gain of the channel connecting itself to the BS 
He with Tffc for any k € {1,..., AT}, the adversary has to 
jam the training phase. Hence, the training-phase jamming is 
capable of limiting the maximum DoF to zero. □ 


In addition to limiting the maximum secure DoF to zero, 
the adversary can make the maximum achievable rate of k- 
th user arbitrarily small as M —7 oo. We next provide the 
conditions under which the maximum achievable rate of k-th 


user goes to a finite value as M —7 oo. 


Corollary 2. (A user’s maximum achievable rate is bounded 

as M oo) In addition to the assumptions given in Theo¬ 
rem^ assume that there exits a finite non negative r such that 
PKm i^) — ^ fi"" all M >1 and x G OCm, where Pkm 
probability density function of Km — and (Km 

is the sample space of Km. Then, the achievable rate of k-th 
user is bounded as 


lim Rk < 

M—>-oo 


log 



r 1 

2 

E 




r 1 

2 

E 




-I- 



□ 


The proof of Corollary can be found in Appendix C-B 


As seen in Corollary if the amount of correlation between 
the BS-to-fc-user channel gain and the estimated BS-to-A:-user 
channel gain, E is smaller than that between the 

BS-to-adversary channel gain and estimated BS-to-fc-th user 


channel gain. 


E 


of fc-th user vanis 


H^^Ht 


the maximum achievable rate 


les as M —>■ (X). 


Remark 6. (Resource race between the adversary and the 
user) We show that if there exists a non zero correlation be¬ 
tween the BS-to-k-user channel gain and the BS-to-adversary 
channel gain, then the maximum secure DoF is constrained 
to zero. Furthermore, we also show that if the amount of 
this correlation is higher than the amount of the correlation 
between the BS-to-adversary channel gain and estimated BS- 
to-k-user channel gain, the maximum achievable rate of k-th 
user goes to zero as M ^ oo. 

Hence, in the presence of the training-phase jamming, 
the achievable rates and the maximum secure DoF are 
determined as a result of the arms race between the 
adversary and users. □ 


Example 1. (Using MMSE and mutually orthogonal pilot 
signals for channel estimation) We study an adversary that 
chooses to match k-th user’s pilot signal on the training phase 
with one of its antennas when MMSE and mutually orthogonal 
pilot signals are used for channel estimation. We show that 
the assumptions given in Theorem 3 are valid under such a 
jamming attack and a channel estimation strategy. Then, we 
show that the maximum secure DoF is zero. 

We consider mutually orthogonal pilot signals 


i.e.. 


fik X (j)* 


TrPr if k = I 
0 if k I 


for any k,l G {1,..., AT}. The received signals at the BS in 
the training phase of i-th block is as follows: 




[Nyy^Hj+ Y, Hi + W, 
Y Pr 


where Pjam A the jamming power. Note that we assume that 






















































the adversary jams the data communication phase and the 
training phase with the same power, which is Pjam- 

In order to validate the assumptions listed in Theorem 
we next present the estimated gain of the channel connecting 
the BS to Tth user at i-th block as 


m) 


aHi{i) + bHe{i) + cVi if I = k 
dHi(i) + eVi if I f k, 



TrPr-\-l-\-Trf^ 


Define 


enojM) for any 

T ^ T'r-^PrPjatn. 

T,.pr + l+TrPja„, 

/Tkpk 


distributed as for any I 

Tr-Pr- 

Tr Pr + 1 -\-Tr Pj c 

1 A TrPr 

’ “ - T,p, + 1 

Stated in Theorem ^ 
cVk, i = 1,..., B. Further, define Hi = aHi 


as Hk{i) = bHk{i) 


aHe{i) 

bHe + cVi if k = I, and otherwise. Hi = dHi + eVi. Note 
that Hk(i), H(i), He{i), H{i) is an i.i.d process due to (0 
and the associated joint distribution is identical with that of 
Hi~, H, Hf,, H, where Hk = aHi; + bHf, + cl4. Hence, we 
conclude that the joint probability distribution of H{i),H(i) 
is identical for any i G {1,..., B}. 

We next show that the probability distribution of H^, H^ 
is identical with that of H^,H^. Note that both {He,Hk) 
and (BfcjBfe) are independent with Hence, noting 

that Hg and H^ have same probability distributions, it is 
sufficient to show that Hk\Hk = hk has the same probability 
distribution with Hk\Hg = hk for any hk G 


P(Bfc < 

x\Hk = 

hk^ 



= P {bhk 

(xHe. 

■F cVk < x\Hk = 

= hk) 


= P {bhk 

(iHq 

+ cVk < x) 


(18) 

II 


+ cVk < x) 


(19) 

= P {bHg 

. + aHk 

+ cVk < x\Hg -- 

= hk) 

(20) 


' (Bfe < x\He = hk^ 


for any x G where •HI and follow from the fact that 
He, Hk, and Vk are mutually independent and follows 
from the fact that {He,Vk) and {Hk,Vk) are identically 
distributed. 

Finally, note that {Hg,,,, Hk„,}m>i forms an i.i.d process 
due to the fact that Hk, Hg,Vk are mutually independent 
random vectors and each is composed of M i.i.d complex 
Gaussian random variables,. 


Note that E 


m Hg 


m Hg 


= b. Since E 

non-zero, we conclude that the maximum secure DoF is zero 
by Theorem □ 


V. Secure communication under Training-Phase 
Jamming 

In the previous section, we showed that massive MIMO 
systems are vulnerable to the training-phase jamming. In this 
section, we first provide a defense strategy against the training- 
phase jamming, that expands the cardinality of the set of pilot 
signals and keeps the pilot signal assignments hidden from the 
adversary. Then, we show that utilizing the defense strategy 


and 5-conjugate beamforming, the BS can satisfy the security 
constraints without using Wyner encoding in the presence of 
the training-phase jamming. Finally, we discuss that relying 
only on the computational cryptography, we can secure the 
communication of pilot signal assignments; hence the entire 
massive MIMO communication. 


A. Counter strategy against training-phase jamming 

We hrst describe our defense strategy against training-phase 
jamming attack. Then, in Theorem we show that the ratio 
of the achieved rate to the logarithm of number of antennas 
can be brought arbitrarily close to maximum achievable secure 
DoF of with the proposed defense strategy that will be 
explained next. 

The BS constructs pilot signal set $ containing L mutually 
orthogonal pilot signals, i.e., $ = {fi,..., fh}, where L is 
larger than the number of users in the system, L > K. Thus, 
the number the pilot signals is increased. At the beginning of 
each block, the BS draws K pilot signals from set $ uniformly 
at random and assigns each of them to a different user. Let 
= [fiii'i), ■ ■ ■, fiKii)] be K pilot signals that the BS 
picks at the beginning of f-th block, where fkii) € ‘h is the 
pilot signal assigned to fc-th user on f-th block. 


Throughout sections V-A and V-B we assume that the BS 


communicates to the users the assignments of pilot signals 
reliably while keeping the assignments hidden from the ad¬ 
versary. In Section [V-C[ we discuss how this can be achieved. 
In particular, we consider computational cryptography as a 
way to communicate the pilot signal assignments and discuss 
the notion of security achieved. 

We next describe the attack model in detail, under the lack 
of knowledge of the pilot signal assignments. Suppose that 
the adversary targets A:-th user without loss of generality. The 
adversary eavesdrops the entire communication between user 
k and the BS and simultaneously jams data communication 
phase with Gaussian noise as in no training-phase jamming 
attack model. Furthermore, the adversary, without knowing 
which pilot signal is assigned to which user, picks J < L 
pilot signals uniformly at random from set <1> at the beginning 
of a block and subsequently jams these pilot signals with an 
equal power during the training phase. The adversary repeats 
this process independently at the beginning of each block. 

Particularly, the adversary divides its jamming power and 
transmits an equally weighted combination of J randomly 
selected pilot signals using all of its Mg antennas with total 
transmission power The signal received by the BS 

during the training phase under this attack model can be 
written as follows: 


K 






14>i 


1=1 


Me 

EE 

ZG3 n=l 


Pja 


Mgjp, 


■Hjji+W, (21) 


where Y'^’’ denotes M x Tr complex matrix of the received 
signals over channel uses at the BS, Hg^ is 1 x M complex 
gain vector of the channel connecting n-th antenna at the 
adversary to the BS, and d is the set of pilot signals that are 


















selected and transmitted by the adversary at the corresponding 
block. Note that 3 is a random set that can possibly change in 
each block and \3\ = J■ 

Next theorem shows that when the cardinality, L of pilot 
signal set is increased as a function of the number of BS 
antennas in a certain way, the ratio of attained secure rate to 
logM for any user can be arbitrarily close to the maximum 
DoF attained in the presence of no adversary. 


Theorem 4. (Achievable rate under training-phase jamming) 

For given block length T and data transmission phase length 
Til, the achievable secure rate, under training-phase jam¬ 
ming satisfies 


Rk 

\ogM 


> 


Td 

T 


min(l, 7 ) 


e 


( 22 ) 


for any k € {1,... ,K}, J € {1,... Tr}, e > 0, and j > 0 if 
, K) < Tr and M > G(e), where 


~ ( ( 1 + MePmax + 


Pmax Pj a 


^iPf Pjam “t“ 1) X 


Pr + Pj am “1“ 1 
PminPr 


(23) 


Pmax = pk, and Pmin — rnin;j.^|]^ 2^1 pi^. n 
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Fig. 5. The change of G(e) with e 
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Note that the lower bound to in ( |22] l does not 

depend on how many pilot signals the adversary chooses 
to contaminate. The proof of Theorem can be found in 
Appendix 

Remark 7. (Attained is arbitrarily close to maximum 
DoF under no attack) We can observe from the statement of 
Theorem that when 7 = 1, that is arbitrarily close to 

the maximum DoF attained under no attack can be achieved. 
In order to attain that amount of the length of the 

training phase Tr is expanded so that Tr > max(A', G{e)) for 
given e > 0 and the size of pilot signal set is set to Tr instead 
of K. Hence, we sacrifice the some of secure throughput by 
increasing the training overhead. However, as illustrated in 
the next example, the typical values for the block lengths for 
mobile wireless communication systems is sufficiently large to 
keep the overhead ratio, ^ reasonably low. 

Example 2. In this example, we consider massive MIMO 
downlink transmission to users moving at a speed 10 m/s 
and the transmitted signal bandwidth is 10 MHz, centered 
at 1 GHz The associated coherence time corresponds T to 
as 3 X 10^ channel uses. We first evaluate the number of 
antennas required to keep Rk in e neighborhood of ^ for 
a given training phase length Tr. To that end, we plot the 
variation ofG{e) with e in when 7 = 1, = 2x 10® 

channel uses, pjam = 1, K = 5, pf = 5, Me = 1, Pr = 10, 
Pk = I for all k € {1,..., AT}. For these set of parameters, 
200 antennas are sufficient to keep larger than ^ — 0.3, 
where ^ = §■ 

Next we study the trade-off between e and ^ for given 


M, where e is the deviation of achieved from ^ as in 
( |22[ ). To that end, we plot the variation of e and ^ with ^ 
for M = 200 as we change ^ from to 1. The values 

of parameters Pjam, K, M^, pk, and pf are kept same as 
stated above and we set pr = ^pf. As seen in Figure e 
vanishes as Tr goes to T and hence also gets closer to 

However, as Tr increases, the training overhead increases 
and hence maximum DoF ^ decreases. 

Remark 8. (Resource race between the adversary and the 
BS) By keeping the pilot assignments hidden from the adver¬ 
sary and using a pilot signal set that scales with M, the BS 
converts the arms race between the adversary and the target 
user (which was the case with known pilot assignments), 
back to the one between the adversary and itself. Indeed, the 
power of the adversary needs to scale with L for it to make 
an impact. 


B. Establishing security without Wyner encoding 

In this subsection, we show that the BS, when utilizing 
deterministic encoding instead of stochastic encoding is still 
capable of satisfying the secrecy and decodability constraints 
in the presence of training-phase jamming. Hence, this sub¬ 


section can be considered as the counterpart of Section III-B 


There, we assumed no training phase jamming, whereas here 
we mitigate training-phase jamming by other means. 

In order to satisfy the security constraints without using 
stochastic encoding, the BS employs 5-conjugate beamforming 
given in and the strategy explained in Section |V-A 


Specifically, Theorem and Corollary provide the number 

















of antennas that the BS requires in order to satisfy only 
the secrecy constraint and both the secrecy and decodability 
constraints, respectively. Note that Theorem]^ and Corollary]^ 
are the counterparts of Theorem and Corollary [T] 


Theorem 5. (Establishing secrecy with no stochastic encod¬ 
ing) Let (5, 7 > 0, and 7 + i5 > 1. Let block length be T 
and length of data transmission phase be Td. In the presence 
of training-phase jamming, for any e > 0 and any rate tuple 
R = [Ri,...,Rk\, if M > Si(e) and Tr > 
then any deterministic code ..., em¬ 

ploying 5-conjugate beamforming satisfies 


BT 


-M 


(24) 


for any J S {1,..., Tf\, B >1, and fc € [1 : K], where 


Si{e)^ 




Mp max 




min(5, (5 + ’7 — 1) 


2^<i - 1 


and Pmax 


A 


maxfeg{i^ Pfe- 


□ 


The proof of Theorem can be found in Appendix E-A 


Note that when 7=1 and 1 > the necessary number of 
antennas to meet the secrecy constraint under training phase 
attack becomes identical to that under no attack. This result 
demonstrates the effectiveness of the defense strategy, hiding 
the pilot signal assignments from the adversary and expanding 
the pilot signal set. 


There is a tradeoff between the number, M, of antennas 
and the length, T^, of the training period necessary to satisfy 
constraints M > S'i(e) and > max(M''', K) . This tradeoff 
is controlled by parameter 7. While choosing 7 close to 1 
minimizes S'i(e) for any e > 0, it increases the length of the 
training period, i.e., the overhead. To observe this: First, S'i(e) 
is minimum at 7 = 1 due to the fact that min((5, i5 + 7— 1 ) < (5 
and equality occurs when 7 = 1 . Second, increasing 7 to 1 
also increases training overhead as Tr has to be larger than 
Si{e)T 

In Theorem we provide the number of antennas required 
to satisfy only the secrecy constraint. Next corollary presents 
the number of antennas that BS needs in order to satisfy both 
the secrecy and the decodability constraints without need for 
stochastic encoding. 


Corollary 3. (Any rate tuple is achievable with no need 
for stochastic encoding) Let 0<(5<1, 7 + (5>l. Let 
block length be T and length of data transmission phase be 
Td and J be any integer in T^}. In the presence 

of training-phase jamming, for any e > 0 and any rate 
tuple R = [Ri,...,Rk], if M > max (Vi(i?), S'i(e)) and 
Tr > M~^, then there exists B(e) > 0 and a sequence of 
codes , ■ ■ •, 2^^^*^, BTd) , B > B(e) that satisfy the 

constraints in and ([TT]i without a need for stochastic 


encoding, where 
Vi{R) = max 


□ 

The proof of Corollary can be found in Appendix |E-B| 

C. How do we hide the pilot signal assignments? 

So far, we have demonstrated that, if pilot signal assign¬ 
ments can be kept secret from the adversary, the impact of 
training-phase jamming can be mitigated by increasing the 
cardinality of the pilot signal set at the expense of some 
increase in training overhead. Next, we discuss how to keep 
the assignments secret from the adversary. 

In order to communicate the pilot signal assignments se¬ 
curely, at the beginning of each block, the BS shares with 
each user a secret key of size logL bits, that is unknown 
to the adversary. In the literature, by far the most popular 
way to generate an information-theoretically secure secret 
key across a wireless channels is via the use of reciprocal 
channel gains ini, m, m. However, we cannot use such 
channel-gain based methods, since for those methods we 
need to observe the channel gains. However, our objective 
of generating the keys is to secure the training phase, whose 
sole purpose is to observe the channel gains in the first place, 
leaving us with a “chicken or the egg” dilemma. 

With this observation, let us consider the methods in which 
these keys are generated and shared by standard private key 
based methods (e.g., Diffie-Hellman ifTSl l or public key based 
methods (e.g., RSA mi Thus, it only relies on existing 
standard computational cryptographic techniques and does 
not rely on information-theoretic techniques for secure key 
sharing. Note that a shared key between the BS and a user 
is used to encrypt the pilot signal assigned to that user 
and the encrypted assignment is communicated to the users 
immediately after key sharing. 

Despite the use of computational cryptographic methods for 
key generation, the security we provide has the “same flavor” 
as information theoretic secrecy, as we clarify next. The main 
drawback of computational cryptographic methods such as 
Diffie-Hellman is that, they make assumptions on the com¬ 
putational power of the adversaries. This kind of security is 
based on the supposition that, given that the key is hidden from 
an adversary via a difficult puzzl^ it takes an unreasonable 
amount of time for an adversary to crack it. Nevertheless, given 
enough time, the adversary will eventually decrypt the message 
(possibly quickly, given a quantum computer, for instance). 
This constitutes the main motivation for information-theoretic 
security, which makes no assumptions on the computational 
powers of the attackers. 

®For example, RSA is based on an NP problem: prime factorization of a 
large number. 


\Pf “t” Pjam “f 1) X (pr + Pjam “t” 1) 
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In our approach, we have a hybrid scheme, combining 
information theoretic security and computational cryptogra¬ 
phy. We are using cryptography to hide the pilot sequence 
assignments, not the message. Encrypting the pilot signal 
assignments is fundamentally different from encrypting the 
message. In message encryption, the signal received by the 
adversary remains vulnerable to cryptanalysis, long after the 
message is transmitted. On the other hand, with pilot signal 
assignment encryption, this window of time for cryptanalysis 
can be arbitrarily small; unless the adversary figures out the 
pilot sequence assigned to the targeted user before the training 
phase starts, the knowledge of the assignment becomes useless. 
But, we know that the training phase starts immediately after 
the encrypted assignment is communicated to the users. If we 
define the computational power required for the adversary as 
the ratio of amount of computation needed to decrypt the key 
via cryptanalysis to the time required to solve the problem, 
the computational power necessary for the adversary to make 
a damage on the targeted user goes to infinity. This addresses 
the shortcoming of existing cryptographic methods due to their 
assumptions on computational powers of adversaries. Note 
that, if the adversary cannot act during the training phase, 
the message transmission is “perfectly secure” as shown in 
Theorem 4. 

It is important to emphasize that, in the above discussion, 
we did not show that the aforementioned defense strategy 
achieves information-theoretic security. Instead, we argued 
that, utilizing our defense strategy of encrypting training sig¬ 
nals, we can avoid one of the main drawbacks of the existing 
computational-cryptographic methods, i.e., assumptions on the 
computational power of adversaries. 

VI. Conclusion 

In this work, we study the physical-layer security of mas¬ 
sive MIMO downlink communication. We first consider no 
training-phase jamming attack in which the adversary jams 
only the data communication and eavesdrops both the data 
communication and training. We show that secure DoF at¬ 
tained in the presence of no training-phase jamming is as 
same as the DoF attained under no attack. This result shows 
the resilience of the massive MIMO against adversaries not 
jamming the training phase. Further, we propose a joint power 
allocation and beamforming strategy, called (5-conjugate beam- 
foming, using which we can establish information theoretic 
security without even a need for Wyner encoding as long as 
the number of antennas is above a certain threshold, evaluated 
in the sequel. 

We next show the vulnerability of massive MIMO systems 
against the attack, called training-phase jamming in which the 
adversary jams and eavesdrops both the training and data com¬ 
munication. We show that the maximum secure DoF attained 
in the presence of training-phase jamming is zero. We then 
develop a defense strategy against training-phase jamming. 
We show that if the BS keeps the pilot signal assignments 
hidden from the adversary and extends the cardinality of the 
pilot signal set, a secure DoF equal to the maximum DoF 


attained under no attack can be achieved. We finally provide 
a discussion why standard computational-cryptographic key 
sharing methods can be considered as strong candidates to 
encrypt the pilot signal assignments and how they achieve a 
level of security that is comparable to information-theoretically 
secure key-generation methods. 


Appendix A 
Proof of Theorem[T] 


We first evaluate an upper bound on the secure DoF. In 
order to derive an upper bound, we assume that there is a 
single user and no adversary in the system. Further, we assume 
that the user and the BS have a perfect information of the 
channel gains. As a last assumption, the user is assumed to 
know the received pilot signals at the BS. Hence, with these 
assumptions, the communication model in Section 2 reduces to 
a multiple input single output (MISO) communication set-up 
in which the channel gains and pilot signals are available at the 
BS and the user. Note that the capacity, the supremum of the 
achievable rates, of this new set-up upper bounds the secrecy 
rates achieved under the communication set-up explained in 
Section |I^ We derive the capacity with the following analysis: 


C = max 
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max --^I(X\Y\F[i) 

p{x\hi),lK[tr{XX*)]<pf T 




yT'^\h. 
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= max —Enog(l-|-P(i/i)||iTi|p)l 

E[Pim)]<pj T LSI V nil 111 


(26) 

(27) 

(28) 


where X'^‘‘ is a complex x M matrix and P(-) : —> 

K+ U {0} is a power allocation function. In the derivation 
above, ( [25] l follows from Section 7.4.1 of EOl where the 
capacity of a communication system in which the channel 
gains are available at both encoder and decoder is stated. The 
equality in ( |26l l follows from the fact that Y^’’ X'^'^, Fli —> 

forms a Markov chain and the equality in ^TT) follows 
from the fact that 

, N 

I |7Ti j < /(X,; Y,|iJi) (29) 

and from the fact that the equality is attained in ( |2^ if 
Px-^diHi Then, the RHS and 

the FHS of @ becomes /(X; Y|iJi). 

In ( |28| l, the equality follows from Section of ED, where 
the capacity of MISO system is evaluated. In ETIl . the power 
allocation function maximizing ( |28] ) is given as 

P(/ii)=(am-p^) , 



where Am is a non-negative real number and is chosen such 
that E[P(i/i)] = Pf- We next find an upper bound on Am 
with the following analysis; 
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where ([30ll follows from the fact that 
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generation: 
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Rk = log 1 -f 


Mpka 


— El for some ei > 0 and 


-1 - y Pf+Pj+1 y 

for k = 1,..., K. Generate K codebooks, Cfc, k = 1,..., K, 

where K is the number of users. Codebook Ck contains 

BTd 


\Hi\ 


is distributed with independently and identically generated codewords, s 


inverse Gamma distribution and has a mean of jjzrf- Hence, 
we have Am < m^ + P/ ^or M > 1. We next bound the 
DoF of the MISO communication system as 
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can be at most 
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if k^l 


Hk — dHk + bVk 
for k G iG}, where a = 


b A ^Pr-Tr- 


Note that E 
for any k 


Hk 


= 0 i 


xM, 
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\\Hkf 


kl ’ 

I G { 1 ,..., 2 ^'^^''}, each is drawn from pklsTd)- 

Encoding: In order to send fc-th user’s message 
Wk S Wfe, the encoder draws index Ik from the 
uniform distribution that has a sample space of 
I {wk - 1) + 1,..., Note 

that this mapping makes the encoder stochastic. The encoder 
then maps index Ik to the corresponding codeword 3^^“^ in 
codebook Ck- 

The encoder employs a conjugate beamforming to map 
codewords to channel input sequence The channel 

input at j-th channel use of i-th block can be written as 
follows: 


(34) 


(35) 
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H*k{i) 


where ak = a for all k G K} due to the fact that 
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\Hk 


= a for all k G {1,..., K}. 


where ( |32| l follows from the fact that P {■) < Am for all 
realizations of Hi, ( |3?| l follows from Jensen’s inequality, and 
follows from (1^. In we show that secure DoF 
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Decoding Each user employs typical set decoding ifT^ . 
Let be the received signal at fc-th user over BTd 

channel uses. The decoder at fc-th user looks for an unique 
index Ik G {l,..., 2®’^'^'^'=} such that G 

where eJ’") is the set of 

jointly typical sequences (Sk^'^,yk^‘‘) with 


Next, we describe an achievability strategy to attain secure 
DoF of 


Channel estimation; Pilot signals are mutually orthogonal, 
i.e., 

TrPr if k = I 



+ Hi 


3 3 


jam.k^jam 


Vk 


where is distributed with CN( 0 ,pfc/T^), j = 1 , 
and Vk is distributed with CN ( 0 , Ixd)- 




for any k,l G {I,..., K}. The BS employs MMSE for chan¬ 
nel estimation. The estimated gain of the channel connecting 
the BS to fc-th user is as follows; 


(36) 

and 


Vk is additive Gaussian noise distributed with ^^(Oj/m)- 


Probability error and equivocation analysis By the 

channel coding theorem HU, E[Pe] ^ 0 as P ^ oo if 
Rk < ^ where expectation is 

over random codebooks, Ci,..., Ck- Note that codebook Ck 
is the realization of Ck- Define 

^ " " ^HkH*k 


= Ma. Eurther, 


K] and for any m,n G [1 : M], 
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= f log(l 
= f log(l 
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where = 






Signal set 5^^'' = 


is defined to be the transmitted codewords of 


Note that E [Tq] = E [Ti] = E [r 2 ] = E [T 3 ] = 0 and 
E[Tor*] = E[Tor*] = E[ror 3 *] = O. We can bound 
^i(sI\Y^A as 


(37) 

(38) 

(39) 


Pf ~t” Pjam T 1 

where ( |J7| ) follows from Theorem 1 of ifTSl and ( |38| l follows 
from the fact that Ti, To, and T 3 are uncorrelated random 
variables. The equality in ( [39] l follows from the fact that 
Var [To] = Mpka, Var [Ti] = pk, Var [T 2 ] = J^j^kPY 
and Yar [T 3 ] = pjam + 1- From ( [39l l, we conclude that 
Rk<^I . Hence, E [P^] ^ 0 as 5 ^ 00 . 

We next analyze the secrecy constraint in ([^. Let 
H {Wk\ be the expectation of the 

conditional entropy in over random codebooks C = 
[Cl,, Ck]- We show that the expectation satisfies the con¬ 
straint in o for fc-th user with the following analysis: 

H{Wk\Z^^,G^,e) > H{Wk\Z^^,S^^YG^,e) 

= H{Wk\ z ^^-^, 3 ^^“, , e) (40) 

= H(Wk, Z^^YS^^YG^,ej 
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> H (s'f, G^, e) 

- 77 I IVfc, G®, e) 
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= 77 Cfc) - 7 Z-®^" 3^^YG^, c) 

- 77 (S'f Wk,Z^'^YS^'^YG^, e) (41) 
= BTRk - I ; Z®^-^ 5^^", G^, c) 

- H (^3^^“ VFfc, Z-®^V G®, e) (42) 

> BTRk - 7 C; Z^^-^ G^) 


the users except fc-th user. Signals Z^^’’ and Z^^"^ are the 
received signals at the adversary over the training phases and 
data communication phases, respectively. Note that Z®^ = 

[Z^T^ ZBT,y 

In the above derivation ( |40l ) follows from the fact that 
ZBT,. (G®, VFfc, C) are independent, ( |4T] ) 

follows from the fact that Cfc) are independent with 

(G®, {Ci}j^^.), and ( |42| l follows from the fact that 3^'^'^ is 
uniformly distributed on a set of size . We continue the 

derivation as 

(|4^ = BTRk - 7 (S'f Z-®^" 3^^YG^'j 

- 77 (s-f VFfc, Z-®^V G^, C) (44) 
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-h{s^'^^ Wk,Z^^,3^^‘^,G^,e 
= BTRk - BTd log (1 -f MePk) 

- 77 IVfe,Z^^, 3^'^YG^, c) (46) 

> BT (^Rk - ^ log (1 + MePfc)^ - BTeo (47) 

= BT{Rk-e) (48) 

for any £2 > 0 and sufficiently large B, where e = ei -f £2 
and He^ in ( |45] ) denotes the gain of the channel connecting 
m-th antenna at the BS to the adversary. The equality in ( |44| ) 
follows from the fact that C —>■ 3'^‘‘^ , 3'^'^^ ,G^ —)■ Z^''^ 

forms a Markov chain. The equality in ( |46] l is due to the fact 
'' 21 
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To get the inequality in we need to bound 

^ H(^3j^^^\Wk,Z^^,3^^YG^,e). Define R, ^ Rk- 

Rk- Note that Re 

^ log (1 + MePko). Hence, as in (52) of iflTlI . utilizing Fano’s 
inequality and the channel coding theorem, we show that 
lims^oo H [ 3 ^^^ I FFfc, Z^^, G, c) = 0. 

From the fact that E [Pe] —)■ 0 as 7? —)■ 00 and from ( |48] l. 










































we conclude that there exists a sequence of codes satisfying 
constraints ([^and ([^. We now evaluate degree of freedom 
dk associated with Rk as 


dk 


y Rk 

hm --— 

M-^-oo log M 


^ + J™ 7T7log(l + MePfc) 

1 M^oo 1 
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for k = 
Td 

T ■ 


I,... ,K. Hence, the attained secure DoF is equal to 
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Appendix B 

A. Proof of Theorem 

Note that since the adversary keeps silent during the training 
phases, the received signals at the BS over training phases 
are independent with H^. Hence, we conclude that = 




K 


and iJg are independent. 


The BS picks message rates Rk > 0, k = l,...,K. 
The equivocation rate for a code ,..., 2®^^^, BTfj 

utilizing deterministic encoding mapping functions, fk, k = 
1,... ,K and (5-conjugate beamforming is as follows; 


— H{Wk\Z^^,G^) 
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straint in 

In the above derivation, ( |49l l follows from the fact that 
and {Z^'^'^ ,G^,Wk) are independent, @ follows from the 
facts that Wk is independent with , G^ and uniformly 
distributed on [1 : 2^^^'=]. In 0 , the inequality follows 
from the fact that Wk —>■ ,G^. 

In ( [5^ , Pk(i,j) = E I|S'fe(i,_))|p], where the expectation is 
over Wk- In ( |5^ , the inequality follows from Jensen’s inequal¬ 
ity and from the fact that ^ Y.J=t^+i Pk{i,j) < Pk- 
In (|54ll, the equality follows from the fact Hk and are 


independent and E 


H*kHe 


Mak, k = 1,..., K. 


B. Proof of Corollary 

Pick 0 < (5 < 1. Pick arbitrary e > 0 and rate tuple R = 
[i?i,..., Rk]- Let M > max(F(i?), 5'(e)). Note that inequal¬ 
ity M > F (i?) implies that Rk < ^ log -f ’ 

k = 1,...,K. We first show that there exists B{e) > 0 
and a sequence of codes ,- - -, 2 ^^^^, BTf) utilizing 

(5-beamforming and deterministic mapping, that satisfy the 
decodability constraint in ( [T 0 | ) for B > B{e). 

The same channel estimation strategy in Appendix A 
is used. Codebook generation is as same as the one in 
Appendix The BS generates K codebooks, Ck, k = 
l,...,Ar, where Ck contains 2 ^^^*= codewords, I G 

To send fc-th user’s message Wk G Wk = {l,..., 2®^'^'“}, 
the BS maps message Wk to the corresponding codeword 
^kwk I*' codebook Ck- Note that there is no randomization 
in the mapping as opposed to the mapping in the encoding 
in Appendix where the codeword is a stochastic function 
of the message. The BS employs (5-conjugate beamforming to 
map codewords to channel input sequence The channel 

input at j-th channel use of i-th block can be written as 


where ak = a and a is defined in ( |36l l. 

The typical set decoding is used at each user as in the 
proof of Theorem in Appendix Hence, since Rk < 
^ = 1,..., AT, by the channel coding 
theorem, there exists a sequences of codes that satisfy con¬ 
straint 

In addition, since M > S(e), the sequence of codes 
mentioned above satisfy the secrecy constraint in ([n) due to 
Theorem]^ Hence, the proof of Corollary [T] follows. □ 
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for any e > 0 and for sufficiently large M, where G = 
. Particularly, for a given e > 0 if M > 
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, then there exists a code that satisfies the con- 


A. Proof of Theorem 

Throughout the proof, we assume that the BS employs 
conjugate beamforming without loss of generality. Suppose 
that Rk is an achievable rate. From the constraints 



























and Fano’s inequality, we have 

-^H(Wk\Z^^\H^.H^.H^)>Rk-5B (56) 

-^H{Wu\Y;^^^)<eB (57) 

where cb and 5b go to zero as B ^ oo. 


The LHS of (|5^ can be written as follows 
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for 1 < i < B and Td + 1 < j < T. The 
equality in ( |58| ) follows from the fact that —> 

Z^Td ^ jjB ^ jjB forms a Markov chain and the equality 

in ( |59| ) follows from the fact that the joint distribution of 
Wk,Z^'^‘^,H^, Hi ,..., ,... is identical with that of 

VFfe, iJf,..., iJf,..., The equality in @ 

follows from the fact that /H^ —>• Z^^'^,Hi, Wk 

forms a Markov chain. 


The upper bound on Rk can be derived with the following 
steps: 
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In the derivation above, 
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follows from ( |56| l arid ( |57 
and ( |64l l follows from the fact that conditioning reduces the 
entropy. The inequality in ( |65] l follows from the fact that 
Wk , Z^G^ . The inequality in ( |6^ 

follows from the memorylesytroperty of the channel and from 

stating that ^iT(z), iT(z)^ 


the assumption in Theorem 


have 


an identical probability distribution for any z > 1. We continue 
the upper bound derivation with the following steps: 
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. Further, and Cg are the 
tively. Define 'Eij as K x K 
covariance matrix of Sk{i,j)- Note that is a diagonal 
matrix due to the fact that each component of S{i,j) are 
independent. The inequality ( |68] l follows from (41) of 1231 . 
where 5 'g(z, j) in (|68ll is distributed with CN (O/f xif j ^ij)- 


Define /(S,j) = I (Scii, j)-,Ykii, j)\ Z{i, j), gY The in¬ 
equality in ( [69] l follows from Jensen’s inequality and Propo¬ 
sition 5 of 1^ that states /(Ey) is a concave func¬ 
tion of Ey. Note that Sq in ( [6^ is distributed with 

CN ^0, J2i=i Yg=t,.+i ■ The inequality in ([^ fol¬ 
lows from (139) of l23l . where S is a set of covariance 
matrices and defined as 

§ = {E : E ^ diag (pi,..., pk) 

and E is a diagonal matrix} (72) 


We can rewrite the random variable inside the expectation as 
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with probability 1, where Pk{G) is defined to be fc-th element 
on the diagonal of E, i.e., E = diag {pi{G),..., pk{G)). 
Note that 

0<Pz(G)<p,, l = l,...,K, 
due to ( |72] l. In ( |74l i, we define 
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for any M > 1 since the second log(-) term in ( [8T] l is non¬ 
negative. We next upper bound E [f+(M)] as follows: 
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[log (]g + ax) — log (]g -f bx)] ^ is non-decreasing if a; > 0 
where a and b are non-negative real numbers and M > 1. 
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where CZ) follows from and from the fact that •ID = 
( |7^ with probability 1 and ( |78| ) follows from the fact that 
liniB-^oo 7 b = 0. 
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as follows 
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where dShll follows from the fact [1 is a continuous function. 
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with probability 1. Further, we have 


Hk^Ht 


(90) 


lim log ( ^ + PfeWfc(G) ) = log ( lim PkWk{G)) 
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= log(^ 
\OLk 
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(91) 


lim Rk = lim — E 
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‘“*ls 


PkVk{G) 


-log ( —+ PkWk{G) 


+ 


= —E 

T 


lim 

M—>-CJO 


log ( ^+PkVkiG) 


log ( j^+PkWkiG) 


where pi) follows from the dominant convergence theorem. 
To apply the dominant convergence theorem, we need to show 
that random variable 


g{M)^ 


log 


+ PkVk{G) 


-log 


— + PkWk{G) 


with probability 1, where ( |88l ) follows from the strong law of 
large numbers. In a similar way we can show that 


is upper and lower bounded by random variables that have a 
finite limit for M > 1. Note that g{M) is lower bounded by 
zero and upper bounded by 


follows from the fact that the 
identical with that 


9{M) < 


< log (2 


log 


^ + PkVk{G) 


-log 

PkVk{G) 


— + PkWk[G) 


PkWk{G))-\og{pkWk{G)) (93) 


with probability 1 for any M > 1. Noting the analysis in i 

in order to show ( |93] l is upper bounded by a random 
variable that has a finite expectation, it is sufficient to show 
the expectation of second log( ) term in ( |9^ has a finite lower 
bound. Hence, 


(94) 


with probability 1. The equality in ([87| follows by combin¬ 
ing ( |90l l and ( |M] ). Hence, the proof ends. 

The proof of Theorem for the case in which the BS 
employs (5-conjugate beamforming can be done in the similar 
way. One only needs to replace CfcSc^ and CeSc* in ( |70| ) with 
and j^CeSc*, respectively and change the rest of 
the proof accordingly. 

□ 


B. Proof of Corollary 

Assume that the BS employs conjugate beamforming with¬ 
out loss of generality. Note that from ( |78l l, we have following 
upper bound; 


(95) 


E [log {pkWk{G))] = log pk + E [log (i(;fc(G))] 

= logpfc -h E [log (■u;fe(G))] 

= log pk - log Qffc -h E [log {Km)] 

> log Pk - log + y ^og{x)pKM (x) dx 

> log Pk - log ak+ \og{x)pKM (x) dx 

Jo 

> log Pk - log Ok + / log{x)rdx 

Jo 

= log Pk - log ak-r log e 

> — 00 , 

where r is defined in the statement of Corollary [^ In ( [94l i, 
the equality follows from the definition of Km in Corollary [^ 
and from the fact that the joint probability distribution of 
(^HgjHk^ is identical with that of (^Hk^Hk^- In ( |95] l, the 
inequality follows from the assumption in Corollary The 
rest of the proof follows from Appendix |C-A 


The proof for the case the BS employs (5-conjugate beam¬ 
forming follows from the same argument at the end of Ap¬ 
pendix C-A □ 


Appendix D 
Proof of Theorem|4] 

The length, of training phase has to be identical to at 
least the size of the pilot signal set L so that the BS can 
generate L > K mutually orthogonal pilot signals. Let J 










































Appendix E 


be any integer in set {l,...,Tr,}. In order to estimate Ic-th 
user’s channel, the BS first projects the received signal during 
the training phase indicated in ( [2l] i to cfik- Then, the BS 
normalizes the projected signal and estimates the gain of the 
channel connecting the BS to fc-th user at i-th block as 


Hk{i) = xi 


TrPrHkii) 

Me 

+n.E 


n—1 


TrPjan 

~Kj 


'-H,{i) + Vk 


(96) 


for any k € where E \\Hk{i) 


= 1 , Vk is 


distributed as 6 ^ 1 ( 0 ,/m) for any k G xi = 

^ and {ni}i>i is an i.i.d Bernoulli pro- 


M^TePe + l+Te^^ 

cess, where P)!!^ = 1) = ^. Event {11^ = 1} indicates that 
the set of pilot signals the adversary contaminates at f-th block 
contains fc-th user’s pilot signal. 


Utilizing stochastic encoding and conjugate beamforming 
as in the proof Theorem [T] we can show that rate 

Mpka 


Rk = 


Y log ( 1 + 


Td 


Pf + Pjam + 1 


-—log ( 1+MePk + 


MMePkPj am^ 
Lpr 


+ 


(97) 


for any fc G is achievable, where a = 

-P 77 r=r • Notice that the rate in (|97li does not depend 

on J. We can rewrite Hk as 


Rk = 


Y log ( 1 + 


Td. 


MpkPrTr 


(pf + Pj am “b l)(Pr7r + Pjam “b 1) 


-l^log ( l+MePk + 


MeMpkP^ 


jam 


Pr-Tr “b Pjam “b 1 
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(98) 


due to the fact that L = T^. Suppose 7 < 1. We bound Rk as 
follows 




log 1 + 


MpkPrM^ 


(pf + Pj am "b ^){PrM'^ + Pjam + 1) 


- log 1 + MePk + 


MeMpkP^ 


jam 


PrM'y 


- J. 


log M + log 
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iPf + Pj am + l)(fcr +Pj am + 1) 


-(1 - 7 ) logM - log ( 1 + MePk + 
= 
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M, 


Pk Pj a 


1 + 


7 logM - log 1 + MePk + 


MePkPj am 


^ (Pf + Pjam + 1) 


Pr + Pj am 1 


PkPr 


+ 


( 100 ) 


where ( |W| follows from the fact that Notice that 

the second logarithm term in ( |100[ ) does not depend on M. 
Hence, we observe that > ^7 — e if M > G(e). In a 

similar way, for 7 > 1 , we can show that ^ — c if 

M > G(e). 


A. Proof of Theorem 

We set the size of the pilot signal set L to T).. Let J be 
any integer in set {1,, T^}. The BS uses the same strategy 
explained in the proof of Theorem in order to estimate the 
gains of channels connecting the BS to users. 

The BS picks arbitrary message rates Rk > 0, 

fc = The equivocation rate for a code 

( 2 BTR 1 ^ ^ ^ BTd) Utilizing deterministic encoding 

mapping functions, fk, k = 1,K and (5-conjugate beam¬ 
forming is as follows: 




> 7?fc - ^ log ( 1 -b 
= ^ log ( 1 -b 


MePk M^ ^MePkPj am(^ 

~W~ ^ Lir 

MePk M^-^MePkPj a 


> ^ log ( 1 -b 


MePk 
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PrTr -b Tr -b 1 

l-S--y MePkPj am 
Pr 


( 101 ) 

( 102 ) 


(103) 


for all k G {1,..., AT}, where a is defined in ( [QT] ) and Hk(i) 
for any fc G {1,..., K} and i S {1,..., B} is given in ( [96l l. 
In the above derivation, ( [T0T] l follows from a derivation that 
is similar to (|49|l-(|5^ in Appendix B-A ( |102| l follows from 
the fact the cardinality of pilot signal set L is chosen as Tr 
and ( |103| l follows from the fact that Tr > M'^. 

As (5 -b 7 > 1, the RHS of ( [TO^ goes to zero as M ^ 00 . 
Eor any e > 0, M > S'i(e) implies that right hand side of 
(|103|l is smaller than e, completing the proof. 


B. Proof of Corollary 

We set the size of the pilot signal set L to Tr- Let J be 
any integer in set {1,..., T^}. The BS uses the same strategy 
explained in the proof of Theorem in order to estimate the 
gains of channels connecting the BS to users. 

Pick 5 and 7 such that 0 < (5 < 1 and 7 -b (5 > 1. 
Pick any arbitrary e > 0 and arbitrary rate tuple R = 
[i?i,..., Rk]- Choose M such that M > max(Vi(i?), S'i(e)). 
Note that inequality M > Vi{R) implies that Rk < 
^ log (1 + all fc G { 1 ,..., a:}, where 

a is defined in ( |97| l. As in the proof of Corollary we 
can show that there exists B(e) > 0 and a sequence of 
codes ,..., 2 ^^^^, BTd) that satisfy the decodability 

constraint in for B > B{e), when (5-beamforming, 
combined with deterministic mapping is used. In addition, 
since M > S'i(e) and R > M'^, following Theorem 
the sequence of codes mentioned above satisfy the constraint 
in completing the proof. 
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